CART-based duration modeling using a novel method of extracting prosodic features

نویسندگان

  • Paul Deans
  • Andrew P. Breen
  • Peter Jackson
چکیده

The prediction of accurate segmental durations remains a difficult problem when synthesising speech from text. Inaccurate durations are often perceptually prominent and detract from the naturalness of the quality of speech. For a concatenative system, a statistical approach is an excellent way of predicting segmental durations. More specifically a CART (Classification And Regression Tree) method is appropriate [1], but only if it has been correctly trained with data that reflects a phoneme’s characteristics. A feature-set is used to describe the flavour of a phoneme in the process of building of CART trees. We describe a novel method where BT’s Laureate Text-to-Speech system (TTS) is used to automatically donate the prosodic information required to make up the feature-set, ultimately being used as training data for building a CART tree. This tree, in turn, is used to predict segmental durations. The extraction of salience (derived from a metrical analysis of the text) and the other prosodic and segmental features in this way, is a novel concept. CART trees consistently show that this salience feature, in particular, has a large effect on the duration of a phoneme. The paper describes in detail this concept and shows the importance of salience. An evaluation of the effectiveness of CART-based duration modelling against the rule-based Laureate TTS method is given in the results.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Novel Method for Tracking Moving Objects using Block-Based Similarity

Extracting and tracking active objects are two major issues in surveillance and monitoring applications such as nuclear reactors, mine security, and traffic controllers. In this paper, a block-based similarity algorithm is proposed in order to detect and track objects in the successive frames. We define similarity and cost functions based on the features of the blocks, leading to less computati...

متن کامل

Polynomial regression model for duration prediction in Mandarin

Duration modeling is to establish a mapping relationship between the prosodic context and the segmental duration engendered in natural speech. In this paper, we first study the effect of prosodic features on segmental duration of neutral utterance in Mandarin by introducing a statistical concept--eta squared, then choose more forceful prosodic features and design interaction quantifying algorit...

متن کامل

A CART approach for Duration Modeling of Greek Phonemes

This paper describes the construction and evaluation of a segmental duration prediction model for Greek language with the application of CART (Classification and Regression Tree) machine learning approach. A ToBI annotated prosodic speech corpus was utilized for the construction of training and testing sets. Our phoneme category was composed of 34 phonemes distributed in 32.072 instances (in 5....

متن کامل

Prosody modelling in Czech text-to-speech synthesis

This paper describes data-driven modelling of all three basic prosodic features – fundamental frequency, intensity and segmental duration – in the Czech text-to-speech system ARTIC. The fundamental frequency is generated by a model based on concatenation of automatically acquired intonational patterns. Intensity of synthesised speech is modelled by experimentally created rules which are in conf...

متن کامل

A Study of the Relationship between Acoustic Features of “bæle” and the Paralinguistic Information

Language users benefit from special phonetic tools in order to communicate linguistic information as well as different emotional aspects and paralinguistic information through daily conversation. Having functions in conveying semantic information to listeners, prosodic features form the essential part of linguistic behavour, manipulating  them potentially can play an important role in transmitt...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1999